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i^i Abstract 

A well-established model for the genealogy of a large population in equilibrium is Kingman's 
coalescent. For the population together with its genealogy evolving in time, this gives rise 
to a time-stationary tree-valued process. We study the sum of the branch lengths, briefly 
denoted as tree length, and prove that the (suitably compensated) sequence of tree length 
processes converges, as the population size tends to infinity, to a limit process with cadlag 
paths, infinite infinitesimal variance, and a Gumbel distribution as its equilibrium. 
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1 Introduction 



Kingman's coalescent (5UJ[5T] is a widely used model for the single-locus genealogy in a population, 
see [30| and references therein. It arises in a suitable rescaling of time under the assumptions 
of a neutral evolution and an exchangeable reproduction dynamics with short-tailed offspring 
distribution. An intuitive way to think of Kingman's coalescent is to imagine a random tree with 
^-j- infinitely many leaves at time i, where backwards in time any two lineages independently coalesce 

("s^ at rate 1. Taking N instead of infinitely many leaves gives Kingman's N-coalescent. The latter 

figures as the genealogy of an N-sample taken from a large population, and also as the genealogy 
of the total population in a standard Moran model with population size N. 
Q\ Two functionals of coalescent trees are of particular interest: the distance from the root to the 

leaves, or depth, and the sum of branch lengths, or tree length. It is well known that the expected 
depth of Kingman's N-coalescent equals 2(1 — ^), whereas its expected tree length is ~ 21ogN 
as N — > oo. More can be said: when compensated by 21ogN, half the tree length of Kingman's 
N-coalescent converges in law to a Gumbel distributed random variable (having the cumulative 
distribution function x t— > e~ e ). This result can be read off from [551 P- 153, first equation]; see 
also [35], EH, [H] and [3D]. 

With individual offspring distributions that are not short-tailed, coalescents different from 
Kingman's appear as the genealogies of large populations. In the so-called A-coalescents [53], 
more than two lines can coalesce, giving rise to multiple mergers, and asymptotic tree length 
distributions arise that are different from Gumbel distributions. For special classes including 
those of Beta-coalescents, results on the asymptotic tree length were obtained in [35], [15], [3] and 
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With a population evolving in time, its genealogical relationships evolve as well. Their evolution 
is described by a tree-valued process |19j similarly as the change of allele frequencies is captured 
by measure- valued diffusions [JJ Q3] . Jumps of the tree depth correspond to the loss of one of the 
currently two oldest families from the population, and hence to the establishment of a new most 
recent common ancestor (MRCA) of the population. The resulting tree depth process in the case 
of Kingman's coalescent was analyzed in [23] and [9]. 

In the present paper we focus on the (compensated) tree length in Kingman's coalescent and 
describe its evolution in an infinite population. Our main result is that this process has cadlag 
paths and infinite infinitesimal variance (Theorem [I]). As already stated, the one-dimensional 
projections of this process are Gumbel distributed. 

We construct the process of compensated tree length, denoted by Jzf , as a limit using tools 
from weak convergence of processes. In addition, we also provide a strong convergence result, i.e. 
a version of Theorem [T] in terms of convergence towards Jzf in probability. For this, we use the 
lookdown process introduced in jllj , which provides genealogies of Moran models of any population 



size on one and the same probability space. Our Proposition 3.2 shows that on this space, the 
compensated Kingman tree lengths lead to a cadlag path-valued limit in probability. Hence, the 
process Jzf can be defined directly in terms of a sequence Moran models - or in terms of the 
lookdown graph - and as such is a natural object to study. Some challenging questions remain, 
e.g. a) Is the limit robust in the sense that Jzf describes also the limiting tree length process for (a 
large class of) Cannings models with short-tailed offspring distributions? b) Is there an intrinsic 
characterization of Jzf in terms of a stochastic dynamics? In particular, is Jzf a semimartingale? 

The length of a coalescent is of relevance in empirical population genetics |30j . In the infinite 
sites model the number of mutations seen in a population of size N at time t is Poisson distributed 
with parameter proportional to tree length and to the mutation rate. The process of tree lengths 
has also attracted interest in the study of diversity in real populations [33 Fig. 2c] . There, sudden 
losses of diversity in a population are related to jumps of the tree length process. Such jumps 
occur at any resampling event and correspond to the length of an external branch breaking off the 
tree. The asymptotics of external branch lengths are investigated in detail in [BJ, see also Remark 
H! and Section 1421 



Our paper is organized as follows. After specifying the model we present our results on weak 
(Section^ and strong (Section |3| convergence of tree lengths and tree length processes. In 
Section |4] we provide some auxiliary results on Kingman's coalescent for fixed times and on Moran 
models. Section [5] completes the proof of Theorem [T] and Section [6] contains the proofs of the 
strong convergence results, Propositions |3.1| and |3.2| 



2 Convergence of tree length distributions 

Consider a Moran model with constant population size N, started at time — ooQ Each (unordered) 
pair of individuals resamples at rate 1; in any such resampling event, one of the two individuals 
reproduces and the other one dies. See Figure [T|A) for an illustration. 

At any time t£R, the common ancestry of all individuals in the population is described by a 
random genealogical tree, which is Kingman's N-coalescent [20]. With time t varying, we obtain 
a tree- valued process denoted by = (^ N ) t£ R, whose random path we can read off from the 
graphical representation, see Figure [ljB) and (C). 

Let I be the map that sends a (finite) tree to its length, i.e. to the sum of the lengths of 
all branches. Back from a fixed time t, each (unordered) pair of ancestral lines coalesces at 
rate 1, therefore the length of the time interval during which the genealogical tree >3^ N has k 
lines is exponentially distributed with the number of pairs, , as parameter. Consequently, the 



1 We use roman upper case letters K,N,... for real (non-random) numbers in order to distinguish them from 
random variables denoted by K, N, .... 
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Figure 1: (A) The graphical representation of a Moran model of size N = 5. Arrows between 
lines indicate resampling events. By resampling the genealogical relationships between individuals 
change. The individual at the tip dies and the other one reproduces. (B) The genealogical 
relationships at time s can be read off from the graphical representation. The vertical bold lines 
constitute the genealogical tree ^ N . (C) By time t the genealogical tree has changed to 
However, some parts of the tree 3? t N were already present in the tree ^ N . 



expectation and the variance of the tree length are 

N N-l 



N-l 



i=2 \2J i=l i=2 \ 2 ) 



i=l i=2 

We are going to study the compensated tree length process 

Jf N := (7(^ N )-21ogN) teR 



(2.1) 



in the limit N — > oo. 

One realization of the process Jzf 100 can be seen in Figure [2j Here, several large jumps of the 
tree length can be observed. Particularly large jumps in the tree length arise when the MRCA of 
the total population changes. 



2.1 Tree lengths at fixed times 

We recall a basic fact about the asymptotics of the law of «if t N as N — > oo. 

Proposition 2.1 (Tree lengths for fixed times). For i 6 i, the law of converges as N — > oo 

weakly to the standard Gumbel distribution with cumulative distribution function ii->e~ e 

Proof. We briefly repeat the argument from |32( p. 255]. Let X 2 ,X3, ... be independent random 
variables such that Xj has an exponential distribution with rate (^)- In addition, let Y\, Y 2 , ... be 
independent such that Yj has an exponential distribution with rate j and Z\, Z 2 , ... be independent 
exponential, each with parameter 1. Then 

N N-l 

m N ) = !E;*i =E^= ^ ^ 

which when shifted by logN has the asserted limit in distribution as N — > oo. □ 
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Figure 2: One realization of the tree length process J2? 100 started at time t — in equilibrium. 
The arrows point to those jumps of the tree length at which the MRCA of the total population 
changes. 



2.2 The evolution of tree lengths 

Next, we come to our main result on the limit of the compensated tree length processes J2? N . We 
denote by D the space of real-valued cadlag functions on the time axis (—00,00), equipped with 
the Skorokhod topology. 



Theorem 1. There is a process Jz? = (J&tjt&n with sample paths in . 

J^ N if as N -> 00. 



such that 



The distribution of |j2? t is Gumbel for all t£R. The process has infinite infinitesimal variance, 
with 



t log* 



' E[(J2$ - J%) 2 ] ^ 4. 



(2.2) 



Remark 2.2 (Connection to external branch lengths). An important characteristics of the tree 
length process is its jump size statistics. For this, let F be a randomly chosen jump time for the 
equilibrium process if N . By the independence properties of the Poisson processes which generate 

5^ N , we have &p_ = £^ and consequently Jz?J?_ = Jzf N . Moreover, the jump removes a randomly 
chosen external branch from £^p_. Since the N-coalescent, restricted to N — 1 randomly chosen 

individuals, is in distribution identical to the (N — l)-coalescent, it follows that £?p = ^, N_1 and 



'0 

consequently jSfj? = if w_i + 2 log (l — ^). Moreover, the jump size, given by Jfp_ — J£p, is 



in distribution identical to a randomly chosen external branch of a N-coalescent in equilibrium. 
Properties of the external branch length distribution are recalled in Section |4.2| and were studied 
in more detail in [BJ. For our setup, these results imply 



j 



for some random variable J, taking values in the positive reals with expectation 2 and density 
x H> 8/(2 + x) 3 . This power law with exponent 3 was already guessed in [25] based on simulations. 
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Figure 3: Schematic picture of tree change between two times and t for a population of size 
N. Tree topology is ignored in the figure and only the number of ancestral lines is given. The 
population at time t has Sf ancestors at time 0. The genealogical tree at time t overlaps with 
the tree at time 0. The subset of the time-t tree which does not belong to the time-0 tree is 
Aqj. Reversely, the subset of the time-0 tree which is lost between time and t is B^ t . The net 

difference in tree length is Jf t N - J^ N = A$ t - BgJ t . 



The random variable J has unbounded variance. For the asymptotics of this variance, [T5] already 
showed (see also Proposition 4.2 ) thalj^] 



V[N(^ N _ - JSf£)] 1 ~ 81og(N). (2.3) 

Remark 2.3 (Heuristics on jump sizes). The fact that the approximate size of a randomly chosen 
jump is of the order 2/N can also be seen from the dynamics of Jz? N . In one time unit, tree length is 
gained by growth of the tree at constant speed N. Moreover, the process makes approximately 
( 2 ) jumps. Since Jz? N is in equilibrium, the tree growth and the jumps have to compensate each 
other. Therefore the expected size of a single jump must be N 2 _ 1 ; compare also with Proposition 



In the light of (2.3), the fact that the limit process Jzf has infinite infinitesimal variance would 
not be surprising if there were no dependencies between jump sizes: In a short time t, the process 
J*f N makes approximately ( 2 )i downward jumps as N — > oo. If jumps would be independent, we 



would get from (2.3) that the variance of Jzf t — Jz? is approximately 4ilogN. 



Remark 2.4 (Idea of the proof of Theorem [T]). The crucial step in the proof, whose details are 
given in Section [HJ is to establish tightness of the family <5? N . This uses auxiliary calculations 
on the evolution of Moran models (Section |4.6[ ): it suffices to show that 'large' jumps in the tree 
length do not happen too often. To be more specific, we must show that during times (t — h, t] 
and (t, t + h] some moment of the smaller jump, S£f — S£^_ h or S£f+ h — Jz? t N , is bounded by Ch 6 
for some constant C and 6 > 1. Heuristically, such a statement is true since it can be shown 
that the times at which one of the / oldest fami lies of the coalescent tree dies out build a Poisson 



process with rate ({): / — 2,3, .. . (see Lemma 4.12). Dying out of one of the / oldest families 



implies larger jumps for smaller / and the proof of tightness requires bounds for / depending on 



2 For sequences (on)n =1 2,... and (&n)n=i 2 we write N ~°° 6 N iff a-^/b-^j N ^°°> 1. 



2 CONVERGENCE OF TREE LENGTH DISTRIBUTIONS 



6 



the time interval h, using that loss times for one of the / oldest families in (t — h, t] and (t, t + h] 
are independent. 

To obtain the form for the infinitesimal variance, it is essential to bound jumps between times 
and t of Jzf N for small i, uniformly in N; see Fig ure |3| for an illustration. Our proof is based 



on auxiliary calculations made in Sections 4.3 and |4.4| Note that changes in Jzf N come from 
two sources. First, between and t, additional tree length is gained by tree growth (Aq t in the 
figure). The random variable Aq t equals the tree-length of a Kingman N-coalescent gained by 

time t and for large N, we see from Lemma 4.10 that liniN-j-oo V[Jio t] '~ Second, a part of 
the tree at time t breaks off (B^ t in the figure). This part is determined by the number Sf of 
ancestors at time of the population at time t. Additionally, note that the number of ancestors 
at time t of the populat ion of size N converges in distribution to some random variable S t with 



St *~° LfJ; see Lemma 4.6 Denoting by _Bk the difference of the compensated tree length of 



a coalescent with infinitely many lineages and the compensated length of the tree spanned by 

~°° V[B St ] 

Combining the results for Aq t and -B^ t we 



a subset of K lineages, using some regularity, we can then show that V[B^ t ] N ^°° V[i?5 t ] 



[B 2 J "~° 4<log | " u 4t| logt\ by Proposition 

L t J 



see that liniN^ V[J? t N - Jzf N ] 4t| logt|. 

Remark 2.5 (Convergence of tree- valued processes). In [15] , a topology r on the space of trees is 
specified and it is proved that the sequence of tree- valued processes ^ N converges in distribution 
to a tree-valued process 27 , whose paths are a.s. continuous with respect to the topology r. One 
might be tempted to use convergence of to 3* in order to show that Jzf N converges to some 
limit process Jzf as well. Such an attempt would require that the function I mapping finite trees 
to their (compensated) lengths is r-continuous. However, if t would be r-continuous we would 
conclude that Jzf has continuous paths, but Jzf clearly makes jumps. Hence, £ is not continuous 
and convergence of cannot be used to show convergence of Jzf N . 

Remark 2.6 (Extension to A-coalescents). In the past decade, the so-called A-coalescents 24 
have gained increasing interest (see e.g. [4] and references therein). Each of these coalescent pro- 
cesses arises as the large population limit of the genealogy of Cannings models and is uniquely 
determined by a finite measure A on [0; 1]. The Kingman coalescent then arises for A = <5o- For 
A =/= <5o , the underlying Cannings models have unbounded variance and the A-coalescents admits 
the possibility of more than two lines merging at the same time. An interesting direction for 
further research is the investigation of potential limits of the tree-length process of the respective 
genealogies of Cannings-models. For this, recent results by [22], [12], [3] and [8] on moments and 
rescalings of A-coalescent trees, and by [5] on the speed of coming down from infinity (which ex- 
tends Aldous' result for the Kingman case, Lemma 4.6 below, to the A-case) will provide important 
ingredients. 

Remark 2.7 (Connection to empirical population genetics). Coalescent trees are of particular 
importance in empirical population genetics and in the analysis of sequence diversity data. In the 
infinite sites model, mutations leading to segregating sites fall on the genealogical tree at constant 
rate. As a consequence, the number of segregating sites is Poisson distributed with a parameter 
proportional to the tree length. As illustrated by Figure [2] the tree length process makes jumps. 
Particularly large jumps occur when the most recent common ancestor of the total population 
changes. At such a time F, one of the two oldest families in the population dies out and a long 
external branch breaks off the genealogical tree (see also [17]). At time F— there are several 
segregating sites which are carried by all individuals which belong to the family which does not 
die out. Such segregating sites become fixed in the population when the MRCA of the population 
changes. In particular, fixation of segregating sites (also denoted by substitutions) come in bursts 
as time evolves, an observation already made by [3T]. In addition, segregating sites which are 
present only in the oldest family which dies out at time F, are lost. 

Observations concerned with the substitutions of segregating sites are special properties of the 
mutation- drift balance. This dynamic equilibrium is between the introduction of new segregating 



4.4 
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Figure 4: Kingman's temporal coupling: In the tree T coming down from infinitely many leaves, 
a tree with N leaves can be embedded by considering the part of T which is below the time at 
which T comes down to N lines. The resulting tree length is Af . 



sites due to mutation and loss of present ones due to genetic drift. Considered between times 
and t, the introduction of new mutations in the population are due to mutation events falling on 
the part of the genealogical tree gained between times and t while the loss of existing mutations 
is due to some part of the genealogical tree at time breaking off by time t. Most interestingly, 
the number of segregating sites in the total population, unlike many other processes in population 
genetics, is 'super-diffusive' in that it has infinite infinitesimal variance, as stated in Theorem [l] 



3 Strong convergence of tree lengths 

Proposition |2.1| and Theorem [T] establish convergence in distribution for the real- valued random 
variables Jz? t N (t fixed) and the D- valued random variables Jzf N . We extend these results by stronger 
notions of convergence, i.e. convergence in probability, almost sure convergence and convergence 



in L 2 . We start with extensions for fixed tim es (Proposition 3.1 ) and then come to the extension 
involving the processes Jz? N (Proposition 3.2). The proofs are given in Section [6] 



3.1 Tree lengths at fixed times 

For the extension of Proposition |2.1| fix t E R. An elegant way to encode a random coalescent 
tree T := 2?t is in terms of (the completion of) a random metric on N, as proposed by Evans |17j . 
To visualize this, consider a sequence of lineages indexed by N, where lineage i starts at time 
in leaf i. Any pair of lineages coalesces independently at rate 1, and a random (ultra) metric R is 
defined by 

R{hj) := ^ • time to the most recent common ancestor of leaves (3-1) 

The completion of (N, R) is a.s. a compact ultra-metric space that represents the (uncountable set 
of) leaves of the coalescent tree. 

There are two canonical ways to approach the compensated length of T by a sequence of 
lengths of finite trees. 
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Figure 5: Kingman's natural coupling: In the tree T coming down from infinitely many leaves, 
a tree with N leaves can be embedded by working down from the first N leaves (obtained by 
sampling) from the infinite tree. The resulting tree length is . In the natural coupling, Kf is 
the number of lines in the small tree while the full tree has i lines. 

1. From the root to the leaves: Let -R(i), R(2)i ■ ■ ■ be a listing of the set {R{i, j) : i, j € N, i =/= j} 
in decreasing order. The random variable 

X k := - R(k)) (3.2) 

then gives the time the tree T spends with k lines "in parallel", k = 2,3, .... We set 

N 

A? :=^fcX fe -21ogN, N = 2,3,... 

k=2 

The random sequence (Af )n=2,3,... is called temporal coupling in [2T]; see also Figure[4] 

2. Across lineages: For N = 2,3, consider the finite subtree 

T N encoded by ({1,..., TV}, i?| {1Ar}2 ) , N=l,2.... (3.3) 

Define 

^2 '■— £(T N ) — 21ogN, N = 2,3,... 
This random sequence is called natural coupling in |21j ; see also Figure [5] 
Proposition 3.1. There is a random variable A = A(R) such that |A is Gumbel distributed, 

A N A (3 4) 

almost surely and in L? and 

A N A (3 5) 

in L 2 . 
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Figure 6: Detail of a look-down graph. Time is running upwards; all lines at the first 8 levels 
are drawn between times and t. At times in Vij an arrow is drawn from i to j, and all lines at 
levels > j are pushed upwards as indicated by bent lines. The dashed line initiates one partition 
element of the partition Q of R x {2, 3, . . .}. The solid lines constitute the tree 7^ ld:S of the first 8 
ancestral lineages back from time t. The sum of the lengths of the solid lines is the tree length of 
the first 8 levels at time t. 



3.2 The evolution of tree lengths 

For the extension of Theorem [l] we briefly review the lookdown construction of [11] ; see also [23] 
for a detailed description and Figure [6] for an illustration. 

Consider the set of vertices V := R x N. A vertex (t, i) is referred to as the individual at time t 
at level i. The source of randomness in the lookdown construction is a family of rate one Poisson 
processes (Vij)x<i<j ■ At times t € Vij, the individual at level j looks down to level i. As the 
illustration in Figure [6] shows, at a lookdown event in Vij, all individuals at levels k > j are pushed 
one level up, and a new line of ascent is born at level j. Note that the individual at level k is 
pushed to level k + 1 at rate ( 2 ) . 

We define the partition Q of R x {2, 3, . . .} into lines of ascent as follows. Every sq G Vij gives 
rise to a partition element G of the form 

([so, «i) x {j} U ([»!, s 2 ) x {j + 1}) U ([s 2 , s 3 ) x {j + 2}) U ... 

with Sk+i > Sk for all k — 0, 1, 2, .... Here, Sk+i is the smallest element in [sfc, oo)nlJ 1<i , <:( -, <fe+:( Tfyj-' 
We say that G is born by (s ,i) and pushed one level up at times sj, S2, .... If (so?*) G G i we 
say that G descends from G' . Since the individual at level k is pushed up by one at rate ( 2 ) and 
hence, for G as above, limfc_ i . 00 Sk is finite. 

For s < t, i < j let G,G' € Q be such that (s, i) € G', (t, j) € G. We say that (s, i) is ancestor 
of (i, j) if either G = G' or there are Gi, G„ such that G descends from G„, Gfc descends from 
Gfc_i, k — 2, ...,n and Gi descends from G'. In this case, we define A s (t,j) :— i. In addition, for 



igR, define the random metric R\ d (compare with (3.1 1) on N by 

R\ d (i,j) = 2 • inf{< - s : A a (t,i) = A s (t,j)}. 
We define the random trees 

^ N := ({l,...,N},^ d | {1 ,..., N}2 ), N = 2,3, 



and 



if/ d - N :=£(7; ld ' N )-21ogN. 



Recall the complete Skorokhod metric dgk on D from [THl Section 3.5]. We are now ready to state 
a result extending Theorem [T] to convergence in probability, proved in Section [6] 
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Proposition 3.2. There is a process Jz? ld , having the same distribution as Jz? from Theorem^ 
such that 



in probability. 



4 Auxiliary results on Kingman's coalescent and the Moran 
model 



In this section we collect some facts on Kingman's coalescent (Subsections 
model (Subsection 4.6 1 which will be required for the proof of Theorem [l 



4TfHj)| and the Moran 



For the Kingman coalescent, we take the tree T as introduced in Section [3] Recall the subtrees 
T N ,N = 2,3,... as defined in (3.3) and the inter-coalescence times X 2 ,X 3 ,... from (3.2). Recall 



that Xk is exponentially distributed with parameter • 

4.1 The Markov Chain K N 

We define 

Kf :— number of lines in T N while T has i lines (4-1) 
and K N := (Kf)i=i ;2 ,...\ see also Figure [H] Note that 

oo 

A*=^#fJQ-21ogN. ( 4 - 2 ) 

i=2 

The connection between the trees T N and T has been described e.g. by [28], [26] and [El Section 
4.4]. Lemma 4.8 of [TS] states that K" builds a Markov chain with one- and two-dimensional 
distributions 



P[^ N = k} = \n+T_T , * > 1, 1 < k < N (4.3) 



/N-k\ (j+k-l\ 

F[Kf = £\K? = k] = Vn "^ + V /_ + '" 1 1 , l<i<j,l<fe<^<N. (4.4) 

VN+i-l/ 

We will need some moment properties of this Markov chain. We leave out the straightforward 
details of the proof. 

Lemma 4.1. For i > 1, 

and for 1 < i < j 

wrr irN V , ^ ^ - W - 1) , ^ - l)N(N - 1) 

E[(, - K, )(j - Kj )] = (N + i _ 1)(N + j _ 1} + (N + J -_ 1)(N + i _ 1)(N + i _ 2) - ( 4 -6) 

4.2 The length of an external branch 

We will recall several facts of the length of a randomly chosen external branch in an N-coalescent 
T N . In this setting we take the inter-coalescence times X 2 , such that Xf is exponentially 

distributed with rate Q), i = 2, ...,N. We denote by J N the length of a randomly chosen external 
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branch. The results we describe in this section are collected from [IS] . )13j . [5] and [6] and stated 
here for completeness. 
We define 

N 

F N :=f iff X ^ 

k=f+i 

i.e. _F N denotes the number of lines extant in the N-coalescent at the time at which the external 
branch connects to the tree. We give a basic fact about -F N and properties of J N . 

Proposition 4.2 (External branches). For f = 1, ...,N — 1, 



F[F N <f] = ^ '- i.e. F[F N = f] 



N(N-l) N(N-l) 
The first two moments of J N are given by 

e[J n ] = |, 



V[J 



N(N- 1) 



Proof. We obtain the distribution of F N as follows: With probability \n\ the randomly chosen 

external branch is not involved in the first coalescence event (bringing the number of lines from 
N down to N — 1). Iterating this argument, we immediately see that the probability that the 
randomly chosen line did not take part in the first N — / + 1 coalescence events is 



[F N < /] = 



(V) (V) (O _©_/(/-!) 



(?) (V) (V) (2) N ( N -D' 

To compute moments of J N , we use the representation 

J N = £ 

Recalling that X2, are independent of the tree topology, and F N is measurable with respect 

to the tree topology, we get that F^jX™, ■■■,X^ are independent. The first two moments of J N 
are now obtained by 
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which also implies that E[-^] = ^ and 



N 

V[J N ] = E[V[J N |F N ]] + V[E[J N |F N ]] = E[ T 



fc=F N + 



2/ 



N-1 N 

E E N(N - I I (i.y 

f=i k=f+i K > [ 2 ) ./-i 

N fc-1 , . N-1 



1 N_1 

i)7^ + 4 ^ 



2/ 



N(N-l)/ 2 N 2 



/n\ ^ *2 ^ /m Z^ J n n 2 

^ fe=2/=l ( 2 ) \2) f=l J 



N-1 

»(£ 7) 



1G 



N 2 N(N-1)\^// N 2 

^N 1 



8ELij-12+^ 
N(N — 1) 





r 2 i 


+ v 


.F N . 


16 




' n 2 





□ 



4.3 Subtrees of coalescents and their lengths 

The aim of this section is to analyze the difference of the tree lengths of T and of T N . Since this 
difference is infinite, we have to carry out a limiting procedure, compensating by the mean. Recall 
that the inter-coalescence times X2,X$, ... are independent of the tree topology of T in general 
and of K2 , , ... in particular. We study the random variable 



00 

B N := ]T ((i - Kf)Xi - E[i - Kf] ■ E[X t \ 

i=2 



(4.7) 



which is the compensated difference of the tree lengths of T and T N . 

Remark 4.3. Let us first make sure that the infinite sum in the definition of Bn exists and has 
expectation zero. To see this, we fix M G {2,3,...} and consider the sequences (-Bn,m)m=2,3,... 
where -Bn,m is defined as -Bn but with the sum ranging from i = 2 to i = M. For M < M', by 
and using that E[XiX 3 ] < 2 • E[Xi] ■ E[Xj] for all ij = 2, 3, ... 

E[(-Bn,m' — -Bn,m) ] 

m' _ 

= E [( E 



Lemma 



4.1 



< 



< 8 



M' 



i=M+l 
M' 



i-Kf 
(i-1) 

1 



N + i 
1 

N + i 



i=M+l j=i 
M' M' 

E Zv jtj - iW+1 - 1) 

i=M+l j=t JU ;v ; 



i(i - 1) 



1 

n+7 



< 8 



M' 



1 



^ (i-1) 2 
=M+1 v ; 



^(m) 



as M 



Hence, the sequence (Bn,m)m=2,3,... is Cauchy in I? and thus converges in L 2 to the limit variable 
-Bn defined in (4.7 1. Furthermore we obtain that E[i?jy] = by continuity of the linear functional 
E on L 2 . 

Proposition 4.4 (Variance of the difference in length of an infinite and a finite coalescent in the 
natural coupling). 

E[ ^ ]N ^8JogN (4g) 
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Proof. First, 

E[X i X j ]=E[X i \-E[X j ]-(l + S ij ). 



In order to obtain (4.8), by a straightforward calculation using Lemma 4.1 



3n N(N-1 



, j(j-l)(N + j-l)(N + i-l)(N + i-2) 

3-3 1-2 ^ ^ 

+ x , N(N-l) 



EG 



^ Vi(i- l)(N + i- l) 2 (N + i-2) (N + i-1)- 

It is clear that the expression in the last line tends to as C/N, for some C > 0, as N — > oo. For 
the expression in the next to last line, we obtain 

~ N(N-l) 1 1 

4^J'0"-l)(N + i-l)^VN + i-2 N + i-1 



(N-1)(j-2) 



( . =3 j(j-l)(N + j-l)(N + j-2) 

N N^oo „ 1 N^oo 8^/1 1 \ N^oo 8 log N 



N 



which proves the Proposition. □ 



4.4 Numbers of ancestors near the tree top 

Let u > 0. Define S*^ 1 to be the (random) number of ancestors at time — u in T N and S u the 
number of ancestors by time —u in T, where 7 ~ N a nd T are defined as in Section § We give 
results on convergence of S^ as N — > oo (Lemma 4.5 ) and on convergence of S u as u — > (Lemma 



4.6). 



Lemma 4.5. For u > 0, 



almost surely and in LP for all p > 0. 



Proof. With probability one, the completion of i? from (3.1) is compact, so T comes down from 
infinity, i.e. with probability one there are at most finitely many lines left by time —u. Since S^ 
is increasing with N, the almost sure convergence follows. To see the L p -convergence, note that 
all moments of the distribution of S u exist; see e.g. j25J Section 5.4]. Since is bounded by S u , 
the families ((S'^) P ) N _ 1 „ are uniformly integrable and convergence in LP follows. □ 

Lemma 4.6. For the number of ancestors S u 

u ■ S u ^> 2, (4.10) 

almost surely and in L 2 . Moreover, 

S u - 2/u _ 



In addition, for u, v — > 0, u < v, u/v — > V < 1 



N(0,1). (4.11) 



(4.12) 
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with the covariance matrix C given by 



C = 



i r 3 / 2 
r3 /2 1 



(4.13) 



Remark 4.7. As a consequence of the previous lemma, the finite dimensional distributions of 
((5t« — ^)/-\/2/(3tu)) t>0 converge as u — > to those of a Gaussian process (A t ) t >o with covariance 

,A t ] = (s/i) 3/2 ~for s < t. 



Proof of Lemma \4-.6\ The convergences (4.101 and (4.11) can be found on p. 27 in pQ. For further 
use below, we prove (4.11) in all detail. Define T n as the time it takes the coalescent to come 
down to n lines, i.e., 



i— 71+1 



Note that 



i ! - 1) n i^u — 1H 



4 

3n3' 



(4.14) 



The central ingredients in the proof are the two facts 

P[S u <n}=P[T n <u], 



2/n 



(4.15) 



where the second assertion is a consequence of (4.14) and the central limit theorem. Hence we 
may define 



a u (x) := [2/u + xy/2/{3u)\ : 



(4.16) 



and write 



S u - 2/u 



< x 



= ¥[S U < a u (x)} 

= nTa u (x) < «] 

T a u (x) - 2/a u (x) u-2/a u (x) 



^A/(3a u (xr) ~ v/4/(3a u (x) 3 ) 

ia\Ta u (x) — 2/a„(x) 



< X 



since 



2/a u (x) x^f&, ^A/Z{a u {xY) 



(4.17) 



Now, (4.11) follows from (4.15). Since the event in ( 4.10[ ) is measurable with respect to the 
terminal a- algebra generated by the independent random variables X2, X3, the convergence in 
( |4.10[ ) holds almost surely. The L 2 -convergence follows from moment results for S u given e.g. in 
Bisection 5.4]. 

Let us turn to the proof of (4.12). Since for m < n 



T m ,T n ] = J2 V K 



i=n-\-l 



4 

3n?' 
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and using (4.14), it is an easy exercise to show that for m, n — > oo, m/n — > T < 1, 

/ \ c) (41g) 

V 4 /(3« 3 ) V 4 /(3m 3 )/ 

with the covariance matrix C given in ( |4.13[ ). To see (4.12) from this, note that for m < n, u < v, 
an extension of (4.15) gives 

P[T„ < u, T m < v] = P[S U <n,S v < m] 
and thus for x,y € K., and using ( |4.16 1, 

= P[S U < a u {x), S v < a v (y)} 



Su - 2/u S v - 2/v ' 



T a u (x) ~ 2/a u (x) u - 2/a u (x) T a j y) - 2/a v (y) ^ v- 2/a v (y) 



< 



~ v/4a u (a;) 3 /3' ^Aa v (y) 3 /3 " v /4a„(y) 3 /3 

T a u (x) - 2/a u (x) T a r y \ -2/a v (y) ■ 
< x, . . , . — < y 



^/Aa u {xf/i 



□ 



by (4.17) and thus, if u, v — > 0,u/v -t T < 1, (4.12) follows from (4.18) since a v (x)/a u {y) 
u/v. 

4.5 The tree length near the tree top 

We analyze now the contribution to the tree length that comes from a small time interval near 
the tree top. To this purpose we define 



A*:= / (S„ N -E[^ N ])d«, A u := / (S v — E[S v ])dv. 
Jo Jo 



(4.19) 



Note that A^ equals A^ u - E[i£ J from Figure E in distribution. Again, we give results on 
convergence of A^ as N — > oo (Lemma 4.9) and of A u as u — > (Lemma 4.10). 

Remark 4.8 (A u as an L -limit). Since the integrand in the definition of A u is unbounded, we 
have to make sure that the random variable A u exists. Indeed, using Lemma |4.6| it is easy to check 
that ( J^_ n (S v — E[S v ])dv) is a Cauchy sequence in L 2 , and we define A u as its L 2 -limit. In 

particular, by continuity of E on L 2 and Fubini's Theorem we further obtain E[A„] = E[A^] = 0. 

Lemma 4.9. For the random variables A^ and A,,, 



(4.20) 



Proof. We start with proving the intuitively obvious fact that S v — and S w — 5^ have nonneg- 
ative correlation. For w < v , we write 



E[SV> S v \S V ,S W ],K[S W S w \S v ,S w ]j 

+ E [COV[5„ - S*, S w - S*\S V ,S V 



S V (S V 1) S W (S W 1) 
N + S v - 1 ' N + S w - 1 



E 



S„(S„-1)N(N-1) 



(N + S w - 1)(N + S V - 1)(N + S v - 2) 



(4.21) 
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where we have used (4.3) for the first and (4.4) for the second term. The second term on the r.h.s. 
is nonnegative, and so is the first term, since i t— > j^fezi is increasing and (S v , S w ) are associated, 



i.e. 



COY[f(S v ),g(S w )}>0 



(4.22) 



for all non-decreasing functions /, g. Indeed, to verify (4.22) it is enough to show this inequality 
for f(S v ) = ls v >i and g(S w ) — ls m >k- This, however, is clear since 

COV[l {Sa >t } ,l {Sv > k} ] =COV[l {Sro > £} ,E[l {Si ,> fe} |S lu ]] > 

by the well-known fact that a single random variable (here S w ) is associated and both l{s„>Q 
and E[l{5 ij>fe }|5 l „] are non-decreasing functions of S w . So we have proved that 

COY[S v - , S w - S%] > (4.23) 

for all v,w > 0. 

Now we come to the proof of ( |4.20[ ). By Fubini's Theorem and (4.23), 

E[(A* - A u ) 2 ] = 2 p f E[(S* - S v - E{S* - S v }) (S% - S w - E[S% - S w ])) dwdv 
Jo Jo 

?^ - S v , S w - S^]dwdv 



= 2 



< 2 



o Jo 

oo rv 



[S v - Sf , S w - S"]dwdv = E[(A£ - A^) 2 ] 



±>0 



o Jo 

X 



by Proposition 4.4 since (A^, — Aoo) 2 is distributed as in that Proposition. 



□ 



Lemma 4.10. For the random variables A u , 

V[A„ 

Proof. By Lemma |4.6| we see that for w < v 

w<v— f 



«->0 



(4.24) 



v-i $u 



2 2 w 3 _ 2 w 
3w 3w v 3 3 v 2 



Hence, by Fubini's Theorem, 



V[A, 



o 



,, S v ]dwdv 



3 Jo 



r dwdv 



□ 



Remark 4.11. Note that A„ is an integral over approximately Gaussian random variables for 
small u. In addition, for s < t, 



A s 



3 

u->0 



4,su \ itu 



2uv st v 

.s 1 

t ' 2 



2 



2uV st Jo 

su P v 2w 



dwdv + 
o • iv ^ Jo 



COV[S v ,S w ]dwdv 



wdw 



rdv 



3/7 I s 3 
2Vt~ 2V t 3 ' 



Hence, as an extension of ( 4.24 ) , we see that the finite dimensional distributions of (j^-tu/ 
converge as u — > to those of a centered Gaussian process (A t )t>o with covariance 



rtu 



t>0 



„A t ] = 
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4.6 The evolution of the / oldest families in the Moran model 

Consider the graphical representation of a Moran model given in Figure [T] For any time t £ K 
the tree can be identified with a random subset of (— oo;i] x {1, ...,N} which we continue to 
denote by 5j N . For any igRwe define the inter-coalescence times 

Xf(t) := length of the time interval in which has i lines (4-25) 

for i = 2, ...,N. We denote by ^ N the rate ( 2 ) Poisson process of all resampling events, viewed 
as a random subset of K. For / = 2, ...,N and t £ R, consider the / oldest families at time t, i.e. 
the / subtrees of ^ N (call them F t N (l), . . . , F t N (/)) whose union is ^ n ({[t - Ef=/+i Xf;t] x 
{1,...,N}). Let 

# N -' := {t : Xf(t) ^ Xf{t-)} C 

i.e. f^ N ^ is the point process of times when one of the / oldest families gets extinct. We set 
Z?<f = (Z?'f(l),...,Z?<f(f)), where 

Z t (i) := size of the family Ff(i), 

i.e. the number of leaves in the tree Ff(i). Note that zf'^ £ Z^ +N := {(zi, . . . , Zf) : z, > 
+ . . . + Zf = N}. The following lemma is essential in the proof of Theorem [I] 

Lemma 4.12 (The / oldest families in a Moran model). Fix f £ {2, N}. LetS^ = {t„ : n £ 1} 
with ... < t < Tl < .... 

1. The uniform distribution on Z^ +N is the stationary distribution for the Markov chains 
(Zf ,f ) t( z R and (Z^ / )„ eZ . 

2. The events ({r„ £ ^ N, ^})nez are independent and P[r„ £ S^'f] — (l)/^)- ^ n particular, 
Qi^'f is a Poisson process with rate ( 2 ). 

Proof. Let t £ R. Looking at the Moran genealogy during the time interval \t — 5^iL/+i -^"fWi ^] ; 
one sees that the / oldest families have been built up through a Polya urn with / ancestral balls. 
This already explains 1. for a fixed (non-random) t. The same arguments apply if t is a resampling 
time. Hence, we have proved assertion 1. 

Let us now consider the dynamics of the sizes of the / oldest families {Z^'') n ^z in its equilib- 
rium, the uniform distribution on Z^ +N . The Moran model is set up as follows: Each pair of balls 
is chosen at unit rate. As soon as a pair is chosen, one of the two balls (selected at random from 
the pair) is transferred into the box of the other ball. (If two balls from the same box are chosen, 
nothing changes.) For the transition of the Markov chain, we compute for [zi, zf) £ 7lJ 



f 

v ' - z f ),r n+1 i ® N -f] = J2 v K: f = *i + 1. sj - 1. - z f) 



l+l 



:,i=i 



++N 

\( Zl + l){ Zj -l) 



f 

E 

i=l 



J2nz% f = (z 1 ,...,z i ,...,z j ,...,z f )M 



(2) 

p[z^y = (*,...,*,)] (($»* + j> 



•? - !) - E Zi 

\2) i=l >,3=i i=l 



= (i-^).p[z?y = (z 1 ,...,« / )]. 



This calculation reveals two things: first, by summing over all possible [zi, zj) on both sides, we 
see that P[t„+i £ 2>] = ({)/(^)- Secondly, we see that Z^^, given r n+ i £ $$, is again uniformly 
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distributed on The last assertion implies that ({r„ € ^ N '^})nez is independent, which 

finishes the proof of assertion 2. 

□ 



5 Proof of Theorem [T] 

We will prove Theorem [l] in three steps. For convergence of jSf N we need to show (see e.g. [To] 
Lemma 3.4.3]) 

(a) The sequence of processes Jzf N is tight in D, 

(b) The finite-dimensional distributions of the sequence Jzf N converge. 

The main work is to show tightness of (,£? N )NeN m D. For this, it is enough to show (see [TBI 
Theorem 3.8.6 and Theorem 3.8.81) 



(Jzf N ) Ne N is tight in R 
and there exists /3 > and 9 > 1 such thaij^jfor all (eR, 

limsu P E[l A |JS*N _ j^/J A _ ^ |/J] < h e 



(5.1) 



(5.2) 



Proposition |2. l| alr eady shows that converges to a Gumbel distributed random variable, 

which implies d5.l|. Now, the main work is to show (5.2), which will be done in Step 1. In Step 



2 we show convergence of finite-dimensional distributions. Step 3 then shows (2.2) 



Step 1 (Proof of (5.2)). Consider a Moran model of size N. We will use the same notation as in 



Subsection |4.6| For any two time points t and t + h, we put 

F™ t+h] := mm{i = 2, N : Xf(t) jt xf(t + h)}. 

Note that {F™ t+h , > /} is the event that none of the / oldest families gets extinct during (t;t + h\. 

Since the point process of losses of one of the / oldest families is Poisson with rate by Lemma 
14.121 we conclude that 



P[*$-M V F^ t+h] </] = (!- e-tt) h f < fh 2 A 1. 



(5.3) 



Using this equation, we will now show (|5.2[) for 6 



and /3 = 10. We write 



?N \10i 



e[i a {^ +h - ^ t N r a - se* H ) 

< E[l A {t£? +h - J2f ) 10 ; F™ t+h] > f$_ M] ] + E[l A (j*f t N - J^,) 10 ; F$_ M > F^ t+h] ]. ( ' } 

We will next bound the first term on the right hand; the bound for the second term is obtained 
in the same manner. We get 

E[lA(J?Z h -J?n W ;F$, t+h] >F$_ htt] ] 

] < h- 2 ^ 

< h- 2 ' 9 



< P[J$_ M < F^ t+h] < h- 2 /»}+El(^ +h -^r;F^ t+h] > h- 2 /»] 

< F{F^_ h t] V F^ t t+h] 



•E 



[( 



N 



^iFX^t + h)-^ 



N 
i=2 



' r (t,t+h] 



(5.5) 



<> 1Q / 9 +E 



N 

i=L/l- 2 /9j 



i=[h- 2 / 9 \ 



(I) 



10 



a For functions a, 6 : — > R_)_ we write a(ft) < b(h) iff there is a C > 0, independent of any other parameter, 
such that a(h) < C ■ b(h) for all h > 0. 
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where the last inequality follows from (5.3) and the fact that Xi(t + h) = Xi(t) on {Fu f+ /ji > 
h~ 2 / 9 } for all i < [h- 2 / 9 \ by definition of F% t+h] . It remains to bound the second term in the 
last line. For this, we define for n = 1, 2, ... 



a n (h) :- 



E E 



and observe that 



ai(h)=0, 



(5.6) 



since the nth central moment of an exponentially distributed random variable with parameter A is 
proportional to A - ". In addition, we use that (a: — y) n < (2x) n + (2y) n for even n and all x, y € K, 



and independence of Xi£ (t), (i) as well as of (< + h), 



, -XjJ (t + /i) to obtain 



E 



4?l- 2 /9J 



(!) 



<E 



10 

^E 

fe=i 



E 

4?l- 2 /9J 



N 

E 

= |>- 2 / 9 j 

10 



(!) 



(!) 



E 



10 



E 

4?l- 2 /9j 



(!) 



E a »i w 



; W<(a 2 (M) 5 <^ 10/9 - 



(5.7) 



H + ...+n fc =10 



by (|5_6j). Plugging (|5_7|) and (5_5j) into (|5_4|) shows that (|5_2j) holds with /3 = 10 and 9 = f. 

Step 2 (Convergence of finite-dimensional distributions). Fix t\ < ... < t n . We will show that 
(Jz?^, -S? t ) converges weakly for N — > oo. The strategy is to define a probability space on which 
all j£^ N , i = l, n, N = 2, 3, are defined. 

Consider a coalescent, started with infinitely many lines which are numbered by 1„, 2„, ... for 
some time t n —t n —\. Denote the number of the ancestors at time t n — t n — i by S n . Number the lines 
going back from these by l n _i, (S n ) n -i and augment them by lines numbered (S n +l) n -i, (S n + 
2) n _!, .... Let these infinitely many lines coalesce for some time i„_i — t n -2i number the S n —\ 
ancestors at time t„_i — t n -2 by l n _2, (<Sn-i)n-2 and augment their lines by lines numbered 
(S n -i + l)n-2, (S n -i + 2)n_2,---- In this way we get iteratively n genealogies for an infinite 
population back from times tx, ...,t n . Considering the compensated tree lengths of lines numbered 
Lt,...,Nj gives the compensated tree length of a population of size N at time ti, i = l,...,n. 
Moreover, as shown in Proposition 3.1 these tree lengths converge in L 2 as N — > oo for each 
i = 1, Since L 2 -convergence implies convergence in probability, which, in turn, implies weak 

convergence, we are done. 



Step 3 (Decomposition of Jz? t — _Sf and proof of (2.2)). Recall the graphical representation of a 
Mo ran model from Figure]!] Using the random set C (— oo,t] x {1, ...,N} (recall Subsection 

4.6), we have the representation (Jzf t N ) teK = (A N (5j N ) — 2 logN) t6K where A N is Lebesgue measure 
on Ex {1,...,N}. We set 



A 0,t 



Bl :=A N (^ N \^ N ), 



4 N 
A 0,t 



4 N 



E[M t ], 



nN nN 

B o,t ■- B 0,t 



(5- 



compare also with Figure § Note that E[A N (5J N \ ^ K 
and thus 



E[A N (^ N \ ^ t N )] due to stationarity, 



-^0 ~~ ^0,t 



5, 



0,f 



(5.9) 
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(5.9) that 



For the infinitesimal variance, we find by the convergence of finite dimensional distributions and 
E[(jgf t ~ ^o) 2 ] = lim E[(jS? t N - ^f] = lim E[« f - < t ) 2 ]. (5.10) 



From and ( |4.19[ ) we conclude that Aff , = Af , again see Figure 3 We have, using the 

L 2 -convergence from Lemma 4.9 



and Lemma 



4.10 



lim V[Aj? t ] = lim V[Af] = V[A t ] *~° U 



N->oo 



N-voo 



(5.11) 



For the variance of -B^ t , note that B$ t = £?n — BgN, where and are as in Sections 



4.4 



and 



Proposition |4. 4 



4.3 



with T := ,% and independent. We thus have for fixed t and N — y oo, because of 



V[< t ] = Y[B N - B sf ] 



N-s-oo . 



N->oo 



(5.12) 



4.4 (4.8), that 



Since t ■ S t —> 2 almost surely as t —> (see Lemma 4.6), we conclude from (5.12) and Proposition 

(5.13) 



lim 

N->oo 



ol/] 



t-yO 



St J 



L 2j] *~°4t| log*! 



Finally, combining (5.10), (5.11) and (5.13), and noting that 



h>Kt] < Vfllogtl by 



the Cauchy-Schwartz inequality, we arrive at 



E[(if t -Jf ) 2 ] 4t|logt| 



This completes the proof of Theorem [T] 

6 Proofs of strong convergence results 

In this section we prove Propositions |3.1| and |3.2| 



6.1 Proof of Proposition 3.1 



From Proposition 2.1 we know that converges weakly as N — > oo to a random variable A such 
that |A is Gumbel distributed. Since is a sum of independent random variables, Kolmogorov's 
three series criterion shows that the convergence holds almost surely as well. Moreover, since 



seconds moments converge in (3.4), the convergence also holds in L 2 . 
Next, we will show that 

A N _ A N N-^ Q 



in L . Together with the L -convergence of A^ this gives (3.5). We compute directly, recalling 
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(4.2) and using that A N = A N in the third equality and Lemma 4.1 

oo N 

E[(A«-A?) 2 ]=E[(5>r*-£i* 



2=2 
OO 

= E [(E 

i=2 



= 2 ( E [(E 

i — 

N 

'[E 



z l ( ^ ;.\ 

i=2 
oo N 



[(E 



oo N 



2E [EE^ N ^^ 



i=2 " i—1 j=2 

N _ N 

= 2 (V | + ( E [ 

i=2 i-2 
N-1 , N-1 



[£**]) -EE 



i=2 j=2 

yN 



oo N 



N + *-li(i-l)iO'-l) 

2 ( 4 Ei2 + ( 2 E7) ~ 4 E 7 E -i- 1\ ~ 4 E 72 



4=2 j=2 
N-1 , oo 



iy(l + *«)) 



N 



4=1 
N-1 



E ,-2 

i=l 

N-1 

E 



i 



N 



j 

N-1 



) 



/- \ N + i 

= 8 N ' 
' i(N + i) N — - / 



N-1 

E 



Ej) 8 E 7 -E(j n -\ 

i=l j = l J i=l 

N-j-oo 8l0gN N 



1 



N 



> 0. 



6.2 Proof of Proposition |3.2 

We recall [TU1 Lemma A2.1]: 

Proposition 6.1. Let (X n ) n= i t 2,... be a sequence of processes with sample paths in D, defined on 
the same probability space. Suppose that (-X" n )n=i,2,... * s relatively compact in D (in the sense of 
convergence in distribution) and that for a dense set H CM., {X™) n= \ t 2,... converges in probability 
in K for each t G H . Then, there is a process X such that dsk(X n , X) "~ > °°> in probability. 

We use this Proposition for (i? ld ' N ) N =2,3,... • First, «5f ld < N = J^ N with «if N as in Theorem 
[lj Hence, as Theorem [l] shows, (Jz? ld,N )N=2.3,... converges weakly. In particular, the sequence is 
relatively compact in D. 



For all t € R, we have that & 



ld,N 



^ N with ST^ from (Q. Consequently, .5f lcMN = A£ and 

old,N 



5ld,N 
-2 + u Q r2 



in L . Since the L -convergence 



there exists a random variable Jz? t ld such that Jz? t ' — Jz? t ld - 
implies convergence in probability we have proved Proposition |3.2| 
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